Method for performing condition based data acquisition in a hierarchically distributed condition based maintenance system

ABSTRACT

A method for accumulating fault condition data from a hierarchical system is presented comprising monitoring an operation of a component with a computing node that includes a processor and a memory. The memory contains a configuration file comprising failure modes (FM), symptoms, tests that identify the symptoms and a corrective action for the symptom. The method further comprises populating at least one of the processor and the memory of the computing node with one or more standardized executable application modules (SEAM) and a workflow service. The one or more SEAMS is configured to create a fault condition record by collecting all FMs that manifest the symptom. For all FMs collected, a list of unique symptoms is produced. Further all tests that can identify the listed unique symptoms and all of the corrective actions associated with the FMs that manifest the symptom are determined from the memory device.

TECHNICAL FIELD

The instant application contains subject matter related to subject matter disclosed in other co-owned, co-pending applications. As such, each of co-owned, co-pending application Ser. Nos. 13/016,601, 13/077,276, 13/115,690, 13/572,518 and 13/273,984 have been incorporated herein as reference in their entirety.

TECHNICAL FIELD

The present invention generally relates to fault model architectures for condition based health maintenance systems monitoring a complex system, and more particularly relates to a method for accumulating data from a hierarchical monitoring system to create fault condition records for otherwise newly experienced symptoms in real time under certain conditions. Such a system is particularly advantageous in situations where a fault condition cannot be diagnosed or identified manually or with conventional systems until the system is corrected and the system cannot be corrected unless the fault condition becomes known.

BACKGROUND

Increases in vehicle complexity and the accompanying increase in maintenance costs have led to industry wide investments into the area of condition based health management (CBM). These efforts have led to the development of industry or equipment specific process solutions. However, conventional CBM systems are generally rigidly configured requiring the user to live with cumbersome performance or pay significant modification costs.

FIG. 1 is a simplified block diagram of an exemplary multi-level health maintenance process 10 that may be useful in monitoring the complex system (not shown). A complex system as discussed herein may be any type of vehicle, aircraft, manufacturing process, or machine that may utilize sensors, transducers or other data sources to monitor the various components and parameters of the complex system. The sensors/transducers are typically situated at the component or the process measurement level 20 to measure, collect and communicate raw data through a variety of data driven input/output (I/O) devices. This raw data may represent fault indicators, parametric values, process status, operating mode, and events, consumable usage and status, interactive data and the like. Non-limiting examples of other data sources may include serial data files, video data files, audio data files and built in test equipment.

Once the parameters of the complex system are measured, the measurement data is typically forwarded to more sophisticated devices and systems at an extraction level 30 of processing. At the extraction level 30, higher level data analysis and recording may occur such as the determination or derivation of trend and other symptom indicia.

Symptom indicia are further processed and communicated to an interpretation level 40 where an appropriately programmed computing device may diagnose, prognosticate default indications or track consumable usage and consumption. Raw material and other usage data may also be determined and tracked. Conventionally, diagnosis, prognostication and default indication is typically determined by applying detected parameters to a large centralized fault model 41, where data, processed data, or fused data is analyzed.

Data synthesized at the interpretation level 40 may be compiled and organized by maintenance planning, analysis and coordination software applications at an action level 50 for reporting and other interactions with a variety of users at an interaction level 60.

Although processes required to implement a CBM system are becoming more widely known, the level of complexity of a CBM system remains high and the cost of developing these solutions is commensurately high. Attempts to produce an inexpensive common CBM solution that is independent from the design of the complex system that is it is to monitor have been less than satisfying. This is so because the combination and permutations of the ways in which a complex system can fail and the symptoms by which the failures are manifested are highly dependent on the system design and the sophistication of the centralized fault model 41.

Conventionally, if additional data is required to determine the cause of a fault, a technician would manually author a data collection specification (i.e., subroutine) and upload it to the CBM system to collect data the next time the fault occurs. However, in some cases the data needed to diagnose the fault cannot be recreated until the fault is corrected. Thus, there exists a catch 22 scenario where technicians cannot fix a problem without additional data; and, the data cannot be obtained without first fixing the problem. One method to address this problem is to record all information at all times. Although such a scheme merely generates an overabundance of useless information that must be processed, stored and transmitted over a bus with finite bandwidth. However, there are certain failure modes for which the CBM node that monitors the asset experiencing the failure mode, or other CBM nodes, can obtain additional data in real time that will help identify the root cause of the failure and the corresponding corrective action. This provides higher fidelity diagnostic information than would be available with only the reported symptoms (aka fault codes, maintenance codes, diagnostic trouble codes, etc.). Such ancillary data collection is done “on condition” in order to reduce the data collection overhead, network bandwidth, CPU time, memory and other system resources. It captures the right data at the right time.

Accordingly, it is desirable to develop a health maintenance system architecture that is sufficiently flexible to support a range of complex systems. In addition, it is desirable to develop a health maintenance system that may be easily reconfigured by a user to allow ancillary data collection when a fault occurs and to automatically determine necessary supplemental data to be collected based on detected symptoms. Such a system dispenses with prohibitive reprogramming costs and delays. Furthermore, other desirable features and characteristics of the present invention will become apparent from the subsequent detailed description of the invention and the appended claims, taken in conjunction with the accompanying drawings and this background of the invention.

BRIEF SUMMARY

A condition based method for accumulating fault condition data from a hierarchical monitoring system is provided. The method comprises monitoring an operation of a component of a complex system with a computing node that includes a processor and a memory device, the memory device containing a configuration file, the configuration file comprising failure modes (FM), symptoms manifested by the failure modes, tests that identify one or more of the symptoms and a corrective action for the failure mode. The method further comprises populating at least one of the processor and the memory of the computing node with one or more standardized executable application modules (SEAM) and a workflow service. The one or more SEAMS are unable to communicate directly with each other. Further, one of the one or more SEAMS is configured to create a fault condition record by determining all FMs that manifest the symptom of the new fault condition from the memory device, examining a data collection specification associated with each of the FMs determined to identify each determined FM that requires supplemental fault condition data from the remote node, and creating a data collection request message to the remote node. The method further includes receiving supplemental data requested in the data collection request message from the remote node, and incorporating the supplemental data into the fault condition record.

A non-transient computer readable storage media (CRSM) resident within a computing node containing instructions for one or more standardized executable application modules (SEAM) that when executed perform acts to create a fault condition record for a symptom. The instructions include determining all FMs that manifest the symptom of the new fault condition from a memory device, examining a data collection specification associated with each of the FMs determined to identify each determined FM that requires supplemental fault condition data from a remote node, and creating a data collection request message to the remote node. The instructions also include receiving supplemental data requested in the data collection request message from the remote node, and incorporating the supplemental data into the fault condition record.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will hereinafter be described in conjunction with the following drawing figures, wherein like numerals denote like elements, and

FIG. 1 is a simplified block diagram of an exemplary multi-level health maintenance process;

FIG. 2 is a simplified functional block diagram for embodiments of hierarchical structure;

FIG. 3 is a simplified schematic of an exemplary reconfigurable system to optimize run time performance of a hierarchical condition based maintenance system;

FIG. 4 is a flow diagram of an exemplary method for configuring/reconfiguring a hierarchical structure of computing nodes that are monitoring various components of the complex system;

FIG. 5 is a simplified block diagram of an exemplary computing node;

FIG. 6 is another simplified block diagram of an exemplary computing node;

FIG. 7 is a simplified logic flow diagram of an exemplary method for coordinating functions of a computing device to accomplish a task according to embodiments;

FIG. 8 is a simplified example of Conditioned Based Data Acquisition;

FIG. 9 is a flow chart illustrating an exemplary condition based method for accumulating fault condition data from a hierarchical monitoring system; and

FIG. 10 is an exemplary subroutine executed by an Allocate SEAM for creating a fault condition and accumulating fault condition data from a hierarchical s monitoring system.

DETAILED DESCRIPTION

The following detailed description is merely exemplary in nature and is not intended to limit the invention or the application and uses of the invention. As used herein, the word “exemplary” means “serving as an example, instance, or illustration.” Thus, any embodiment described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments. All of the embodiments described herein are exemplary embodiments provided to enable persons skilled in the art to make or use the invention and not to limit the scope of the invention which is defined by the claims. Furthermore, there is no intention to be bound by any expressed or implied theory presented in the preceding technical field, background, brief summary, or the following detailed description.

Those of skill in the art will appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. Some of the embodiments and implementations are described below in terms of functional and/or logical block components (or modules) and various processing steps. However, it should be appreciated that such block components (or modules) may be realized by any number of hardware, software, and/or firmware components configured to perform the specified functions. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps are described herein generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention. For example, an embodiment of a system or a component may employ various integrated circuit components, e.g., memory elements, digital signal processing elements, logic elements, look-up tables, or the like, which may carry out a variety of functions under the control of one or more microprocessors or other control devices. In addition, those skilled in the art will appreciate that embodiments described herein are merely exemplary implementations.

The various illustrative logical blocks, modules, and circuits described in connection with the embodiments disclosed herein may be implemented or performed with a general purpose processor, a controller, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration. The word “exemplary” is used exclusively herein to mean “serving as an example, instance, or illustration.” Any embodiment described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments.

The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal. Processors in all of their forms that currently exist or that will exist in the future, as well as storages media in all of their forms that currently exist and that will exist in the future, are considered herein to be computer readable storage media.

In this document, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Numerical ordinals such as “first,” “second,” “third,” etc. simply denote different singles of a plurality and do not imply any order or sequence unless specifically defined by the claim language. The sequence of the text in any of the claims does not imply that process steps must be performed in a temporal or logical order according to such sequence unless it is specifically defined by the language of the claim. The process steps may be interchanged in any order without departing from the scope of the invention as long as such an interchange does not contradict the claim language and is not logically nonsensical.

Furthermore, depending on the context, words such as “connect” or “coupled to” used in describing a relationship between different elements do not imply that a direct physical connection must be made between these elements. For example, two elements may be connected to each other physically, electronically, logically, or in any other manner, through one or more additional elements.

While at least one exemplary embodiment will be presented in the following detailed description of the invention, it should be appreciated that a vast number of variations exist. It should also be appreciated that the exemplary embodiment or exemplary embodiments are only examples, and are not intended to limit the scope, applicability, or configuration of the invention in any way. Rather, the following detailed description will provide those skilled in the art with a convenient road map for implementing an exemplary embodiment of the invention. It being understood that various changes may be made in the function and arrangement of elements described in an exemplary embodiment without departing from the scope of the invention as set forth in the appended claims.

FIG. 2 is a simplified functional block diagram for embodiments of hierarchical structure 200 that may be timely reconfigured by the user. This may be accomplished by altering a set of configuration data 180 via a data driven modeling tool 171, which also may be described as a model based configuration means. The configuration data 180 may be stored in a static data store (e.g. a ROM), a dynamic data store (e.g. RAM), or both 190

In light of the plethora of complex systems that may be monitored by the embodiments being described herein below and the wide range of functionality that may be desired at any point in the complex system, the following description contains non-limiting examples of the subject matter being disclosed herein. A specific non-limiting example of a complex system that may complement the following exemplary embodiments may be the vehicle as described in co-owned, co-pending application Ser. No. 12/493,750 to David Goldstein.

For the sake of brevity and simplicity, the present example will be assumed to have only five different processing levels or “application layers.” An Application Layer (120-160) is a set of functions or services programmed into run-time software resident in one or more computing nodes sharing a particular hierarchical level and which is adapted to meet the needs of a user concerning a particular health management implementation. As non-limiting examples, an application layer may be an Equipment Health Manager (EHM) Layer 120, an Area Health Manager (AHM) Layer 130, a Vehicle Heath Manager (VHM) Layer 140, a Maintainer Layer 150, or an Enterprise Layer 160.

However, in equivalent embodiments discussed herein, the hierarchical structure 200 may have any number of levels of application layers (120-160). Application layers (120-160) may include any number of computing nodes, which are computing devices. The number of nodes is determined by the complexity of the complex system and the sophistication of the monitoring desired by the user. In some embodiments, multiple nodes (120-160) may be resident in one computing device. The computing nodes of the equipment based layers (EHM Layer 120, AHM Layer 130, VHM Layer 140, Maintainer 150 and Enterprise 160) may be also referred to as an EHM 120′, an AHM 130′, a VHM 140′, a maintainer node 150′ and an enterprise node 160′.

In the exemplary embodiments disclosed herein, an EHM 120′ is a computing device that provides an integrated view of the status of a single component of the complex system comprising the lowest level of the hierarchical structure 200. The EHM 120′ may have different nomenclature favored by others. For example, in equivalent embodiments the EHM 120′ also be known as a Component Area Manager (CAM). A complex system may require a large number of EHMs (120′), each of which may include multiple times series generation sources such as sensors, transducers, Built-In-Test-Equipment (BITE) and the like. EHMs (120′) are preferably located in electronic proximity to a time series data generation source in order to detect symptomatic times series patterns when they occur.

An AHM 130′ is a computing device situated in the next higher hierarchical level of the hierarchical structure 200 and may receive and process message, command and data inputs received from a number of EHMs 120′ and other nodes 130′-160′. An AHM 130′ may report and receive commands and data from higher level or lower level components of the hierarchical structure 200. An AHM 130′ processes data and provides an integrated view of the health of a single sub-system of the complex system being monitored. The AHM 130′ may have different nomenclature favored by others. For example, in equivalent embodiments the AHM 130′ also be known as a Sub-system Area Manager (SAM). In some embodiments, an AHM 130′ may include a node specific DDR 45.

A VHM 140′ is a computing device situated in the next higher hierarchical level for the hierarchical structure 200 and may receive and process message, command and data inputs received from a number of EHMs 120′ and AHMs 130′. A VHM 140′ may report and receive commands and data from higher level components of the hierarchical structure 200 as well. A VHM 130′ processes data and provides an integrated view of the health of the entire complex system being monitored. The VHM 140′ may have different nomenclature favored by others. For example, in equivalent embodiments the VHM 140′ also be known as a system level control manager (SLCM). In some embodiments, a VHM 140′ may include a node specific DDR 45.

A Maintainer Layer 150 contains one or more computing node (150′) that analyze data received from the EHMs 120′, AHMs 130′ and VHM(s) 140′ and supports local field maintenance activities. Non-limiting examples of an Maintainer Level computing system is the Windows® PC ground based station (PC-GBS) software produced by Intelligent Automation Corporation a subsidiary of Honeywell International of Morristown, N.J.; or the US Army's Platform Soldier-Mission Readiness System (PS-MRS). The Maintainer Layer system may have different nomenclature favored by others. Nodes 150′ also receive data, commands and messages from higher level nodes 160′. In some embodiments, the maintainer node may include a node specific DDR 45.

An Enterprise Layer 160 contains one or more computing nodes (160′) that analyze data received from the EHMs 120′, AHMs 130′, VHM(s) 140′ and the Maintainer Layer 150. The Enterprise level supports the maintenance, logistics and operation of a multitude or fleet of assets. Non-limiting examples of an Enterprise Layer 160 computing system is the ZING™ system and the Predictive Trend Monitoring and Diagnostics System from Honeywell International. The Enterprise layer system 160′ may have different nomenclature favored by others and may include a DDR 45.

In accordance with the precepts of the subject matter disclosed herein, each computing node (120-160) of each level of the hierarchical structure 200 may be individually and timely configured or reconfigured by a user of the data driven modeling tool 171. Using the model driven GUI 170 (See FIG. 2), the data driven modeling tool 171 allows the user to directly alter the configuration data 180, which in turn provides specific direction and data to, and/or initiates, one or more standardized executable application modules (SEAM) (221-264) via the configuration file 185 resident in each computing node (120-160) of the hierarchical structure 200. In the following description the term “configure” and “provide specific direction and data” may be used synonymously.

In some embodiments, the data driven modeling tool 171 may be used to create a configuration file 185 for each of one or more computing nodes (120-160). This may be accomplished by editing the configuration file 185 of a node (120-160) to provide specific direction and data to each of an Allocate SEAM 241 and a Coordinate SEAM 252 that are populating a node thereby creating the DDR 45 for that node. The specific direction and data is found in the SDS 350 a and DDS 350 b of the configuration file 185 as authored by a system designer. Further, disclosure regarding how a configuration file, a SEAM (220-264) and a workflow service 310 work together to provide specific direction and data to a SEAM may be found in co-owned, co-pending application Ser. Nos. 13/077,276 and 13/273,984, which are incorporated herein in their entirety by reference.

The number of possible SEAMs (221-264) is not limited and may be expanded beyond the number discussed herein. Similarly, the SEAMs (221-264) discussed herein may be combined into fewer modules or broken down into component modules as may be required without departing from the scope of the disclosure herein. The SEAMs (221-264) comprise a set of services, run-time software, firmware and knowledge management tools that are selectable from one or more re-use libraries (220-260) and are subsequently directed to meet the health management implementation needs of a user. Each SEAM (221-264) contains executable code comprising a set of logic steps defining standardized subroutines designed to carry out a basic function that may be directed and redirected at a later time to carry out a specific functionality.

There are 24 exemplary SEAMs (221-264) discussed herein that are broken down into five non-limiting, exemplary libraries (220, 230, 240, 250 and 260). The standardized executable applications (221-264) are basic un-modifiable modular software objects that are directed to complete specific tasks via the configuration data 180 after the standardized executable software modules (221-264) are populated within the hierarchical structure 200. The configuration data 180 is implemented in conjunction with an executable application (221-264) via the delivery of a configuration file 185 containing the configuration data 180 to a node. Once configured, the SEAMs (221-264) within the node may then cooperatively perform a specific set of functions on data collected from the complex system. A non-limiting example of a specific set of functions may be a health monitoring algorithm.

As non-limiting examples, the Measure Library 220 may include an Acquire Module 221. The Acquire Module 221 functionality may provide a primary path for the input of data into a computing node (120-160) through a customized adapter 325 which embodies external callable interfaces. The customized adapter 325 pushes blocks of data into the Acquire Module 221, which then parses the data block and queues it for subsequent processing by another executable application (222-264).

The Measure Library 220 may include a Sense Module 223. The Sense Module 223 may provide a secondary path for the input of data into a computing node (120-160) through a system initiated request to read data from a physical I/O device (i.e. Serial data ports, Sensor I/O interfaces, etc.). The Sense Module 223, which then parses the data block and queues it for subsequent processing by another executable application (222-264).

The Measure Library 220 may include a Decode Module 222. The Decode Module 222 may take the data queued by the Acquire Module 221 or Sense Module 223 and translate the data into a useable form (i.e. symptoms and/or variables) that other executable applications can process. The Decode Module 222 may also fill a circular buffer with the data blocks queued by an Acquire Module 221 to enable snapshot or data logging functions.

The Extract Library 230 may include an Evaluate Module 231. The Evaluate Module 231 may perform a periodic assessment of state variables of the complex system to trigger data collection, set inhibit conditions and detect complex system events based on real-time or near real-time data.

The Extract Library 230 may include a Record Module 234. The Record Module 234 may evaluate decoded symptoms and variable to determine when snapshot/data logger functions are to be executed. If a snapshot/data log function has been triggered, the Record Module 234 may create specific snapshot/data logs and send them to a dynamic data store (DDS) 350 b. Snapshots may be triggered by another executable application (221-264) or by an external system (not shown).

The Extract Library 230 may include an Analyze Module 232. The Analyze Module 232 may run one or more algorithms using the variable values and trend data that may have been assembled by a Trend Module 233 and subsequently stored in a dynamic data store (DDS) 350 b to determine specific symptom states and/or provide estimates of unmeasured parameter values of interest. The DDS 350 b is a data storage location in a configuration file 185.

The Interpret Library 240 may include an Allocate Module 241. The Allocate Module 241 may perform inhibit processing, cascade effect removal and time delay processing on a set of symptoms detected by a node in the complex system and then allocate the symptoms to the appropriate fault condition(s) that is specified for the monitored device or subsystem. The Allocate Module 241 may also update the fault condition data based on changes in the value of any particular symptom associated with a fault condition and enforce consistency for mutually exclusive symptoms. A Diagnose Module 242, when properly configured by an associated configuration file 185, may be a component of a DDR 45 with in a node.

In various embodiments of the subject matter disclosed herein, diagnostic analysis of a component of a complex system is based on the notion of a “fault condition,” which is a reasoning construct used to collect evidence of a fault and manage the lifecycle of a detected fault. In practice, a fault condition is a set of static and dynamic data values that are used for diagnostic, prognostic or other analysis. Data values to conduct these analyses are stored in the SDS 350 a and the DDS 350 b. Diagnostic analysis is performed by a number of SEAMs within the Interpret library 240 as discussed further below. Interactions and coordination between the SEAMs is controlled by the Workflow Service 310 as is more fully describe in co-owned, co-pending application Ser. No. 13/077,276, which is incorporated herein in its entirety.

A function of the Allocate Module 241 is to determine if the symptoms are associated with a fault condition that has been detected previously, or if the evidence is indicative of the presence of a new fault. If the symptoms are not associated with any of the active Fault Conditions stored in a local dynamic data store (e.g., DDS 350 a, FIG. 2), a new fault condition is created and stored in a local static data store (e.g., SDS 350 b, FIG. 2) for future use.

When the Allocate Module 241 has completed processing the detected evidence of a fault condition, it asserts an event indicating that allocation processing has been completed. The Workflow Service 310 responds to the asserted event as described in co-owned, co-pending U.S. patent application Ser. No. 13/077,276 for example, which is incorporated herein by reference in its entirety. In some embodiments, the response to this event is to invoke the Rank module 243.

The Interpret Library 240 may include a Diagnose Module 242. The Diagnose Module 242 may orchestrate interaction between monitored assets and diagnostic reasoning to reduce the number of ambiguous failure modes for a given active fault condition until a maintenance procedure is identified that will resolve the root cause of the fault condition. A Diagnose Module 242, when properly configured by an associated configuration file 185, may be a component of a DDR 45 with in a node.

In some embodiments, a salient function of the Diagnose Module 242 is to manage the lifecycle of a fault condition. Based on configuration data stored in the SDS 350 a, and the content and attributes of the fault condition that are stored in the DDS 350 b, the Diagnose Module 242 calculates a value that identifies one of a set of defined states for a given fault condition. The value of the fault condition state is used by the Diagnose Module 242 and other SEAMs to control such things as user access, content of datasets, and allowable operations. Another aspect of fault condition lifecycle management is to detect Repeat Faults, detect and perform “Fault Condition Split” and “Fault Condition Merge” operations.

Some of the calculations performed by SEAMS from the Interpret Library 240 are based on a “single fault assumption”. The single fault assumption postulates that in the presence of multiple simultaneous symptoms associated with a common fault, it more likely that the symptoms are caused by a single fault rather than from multiple simultaneous faults. This assumption, however, can be problematic when simultaneous faults do occur. To mitigate the impact of the simplifying “single fault assumption” in multiple fault situations, the Fault Condition Split operation allows the Diagnose Module 242 to examine fault conditions that have not been isolated to a single failure mode (e.g., the failure mode ambiguity group size=1) and uses statistical methods to identify when it is more likely that there is more than one fault that is contributing to set of active Symptoms in the Fault Condition “symptoms of interest” list and creates a new Fault Condition. These statistical calculations may be found among the preprogrammed common utilities 306 and may be called by the Diagnose Module 242.

It is also possible that a lack of fidelity in the fault model can cause the Allocate Module 241 to create a new fault condition when, in fact, there is only one fault present in the monitored asset. In this case, the Diagnose Module 242 examines all the active fault conditions for the monitored fault and uses statistical methods to identify when it is more likely than not that there is a single fault that is contributing to set of active symptoms in multiple fault conditions and then merges the multiple (e.g. two) fault conditions into a single fault condition. When the Diagnose Module 242 has completed processing the fault conditions, it asserts an event indicating that Diagnose Module processing has been completed. The Workflow Service 310 may respond to the asserted event as described in co-owned, co-pending U.S. patent application Ser. No. 13/077,276. In some embodiments, the response to this event is to invoke a Report Module 253 when the Report module is installed and configured in the node (120-160).

The Interpret Library 240 may include a Rank Module 243. The Rank Module 243 may rank order the failure modes, related corrective actions (CA) and relevant test procedures associated with a particular active fault condition are ranked according to predefined criteria stored in a Static Data Store (SDS) 350 a. An SDS 350 a is a static data storage location in a configuration file 185. A Rank Module 243, when properly configured by the configuration file 185, may be a component of a DDR 45 within a node.

In some embodiments, a salient function of the Rank module 243 is to analyze the available evidence of a fault condition received at a node and produce data that will help isolate and correct the fault condition causing the evidence to be manifested. The specific algorithms performed by the Rank module 243 may be reconfigurable, based on the information desired and the fault model architecture.

In at least one embodiment, the Rank Module 243, can be configured to calculate the likelihood that certain failure modes may be present, the likelihood that certain repairs may be needed to correct the fault, the benefit of collecting additional symptoms, and the cost/benefit ratio of performing additional tests to isolate the failure mode or repair. In other embodiments, the Rank Module 243, can be configured to produce a data summary that represents partial diagnostic conclusions determined at a node (120-160) for use by higher layer computing nodes. When invoked by the Workflow Service 310, the Rank Module 243 may examine each fault condition that contains updated or new symptoms in a “Symptoms of Interest” list. For each of the qualifying fault conditions in the list, statistical analysis methods available in the common utilities or elsewhere in the framework services may be used to calculate the following:

-   -   1. For each failure mode associated with the symptoms, the         likelihood that the failure mode is present.     -   2. For each corrective action associated with the failure modes,         the likelihood that the Corrective Action will fix the fault.     -   3. For each symptom associated with a failure mode, the average         gross and net cost benefit realized if the symptom state was         known.     -   4. For each diagnostic test associated with the Symptoms that         are produced by the test, the average gross and net cost benefit         of the performing the test.

When the Rank module has completed processing the evidence, it asserts an event indicating that allocation processing has been completed. The Workflow Service 310 may respond to the asserted event as described in co-owned, co-pending U.S. patent application Ser. No. 13/077,276, for example. In some embodiments, the response to this event is to invoke the Diagnose Module 242.

The Interpret Library 240 may include a Predict Module 244. The Predict Module 244 may run prognostic algorithms on trending data stored in the DDS 350 b in order to determine potential future failures that may occur and provide a predictive time estimate.

The Interpret Library 240 may include a Consumption Monitoring Module 245. The Consumption Monitoring Module 245 may monitor consumption indicators and/or may run prognostic algorithms on trending data stored in the DDS 350 b that are configured to track the consumption of perishable/life-limited supply material in the complex system and then predict when resupply will be needed. The consumption monitoring functionality may be invoked by a workflow service module 310, which is a component functionality of an internal callable interface 300 and will be discussed further below.

The Interpret Library 240 may include a Usage Monitoring Module 246. The Usage Monitoring Module 246 may monitor trend data stored in the DDS 350 b to track the usage of a monitored device or subsystem in order to estimate the need for preventative maintenance and other maintenance operations. The usage monitoring functionality may be invoked by the workflow service 310, which is a component functionality of the internal callable interface 300.

The Interpret Library 240 may include a Summarize Module 247. The Summarize Module 247 may fuse health data received from all subsystems monitored by an application layer and its subordinate layers (120-160) into a hierarchical set of asset status reports. Such reports may indicate physical or functional availability for use. The asset status reports may be displayed in a series of graphics or data trees on the GUI 170 that summarizes the hierarchical nature of the data in a manner that allows the user to drill down into the CBM layer by layer for more detail. The Summarize functionality may be invoked by the Workflow service 310. This invocation may be triggered in response to an event that indicates that a diagnostic conclusion has been updated by another module of the plurality. The display of the asset status may be invoked by the user through the user interface.

The Act Library 250 may include a Schedule Module 251. The Schedule Module 251 schedules the optimal time in which required or recommended maintenance actions (MA) should be performed in accordance with predefined criteria. Data used to evaluate the timing include specified priorities and the availability of required assets such as maintenance personnel, parts, tools, specialized maintenance equipment and the device/subsystem itself. Schedule functionality may be invoked by the workflow service 310.

The Act Library 250 may include a Coordinate Module 252. The Coordinate Module 252 coordinates the execution of actions and the reporting of the results of those actions between application layers 120-160 and between layers and their monitored devices/subsystems. Exemplary, non-limiting actions include initiating a BIT or a snapshot function. Actions may be pushed into and results may be pulled out of the Coordinate Module 252 using a customized adapter 325 a-e which embodies an external callable interface. The customized adapter 325 a-e may be symmetric such that the same communications protocol may be used when communicating up the hierarchy as when communicating down the hierarchy.

The Act Library 250 may include a Report Module 253. The Report Module 253 may generate a specified data block to be sent to the next higher application in the hierarchy and/or to an external user. Report data may be pulled from the Report Module 253 by the customized adapter 325 a-e. The Report Module 253 may generate data that includes a health status summary of the monitored asset.

The Act Library 250 may include a Track Module 254. The Track Module 254 may interact with the user to display actions for which the user is assigned and to allow work to be accomplished or reassigned.

The Act Library 250 may include a Forecast Module 255. The Forecast Module 255 may determine the need for materials, labor, facilities and other resources in order to support the optimization of logistic services. Forecast functionality may be invoked by the Workflow service 310.

The Act Library 250 may include a Log Module 256. The Log Module 256 may maintain journals of selected data items and how the data items had been determined over a selected time period. Logging may be performed for any desired data item. Non-limiting examples include maintenance actions, reported faults, events and the like.

The Interact Library 260 may include a Render Module 262. The Render Module 262 may construct reports, tabularized data, structured data and HTML pages for display, export or delivery to the user.

The Interact Library 260 may include a Respond Module 261. The Respond Module 261 may render data for display to the user describing the overall health of the complex system and to support detailed views to allow “drill down” for display of summary evidence, recommended actions and dialogs. The rendering of display data may be initiated by the Workflow service 310; but the data may be pulled from the Render Module 262 via the callable interface 300. The Respond Module 261 may also receive and process commands from the user then route the commands to the appropriate module in the appropriate node for execution and processing. The commands may be pushed into the Respond Module via the callable interface 300.

The Interact Library 260 may include a Graph Module 263. The Graph Module 263 may provide graphical data for use by the Render Module 262 in the user displays on GUI 170. The graphical data may include the static content of snapshot and trend files or may dynamically update the content of the data in the circular buffer.

The Interact Library 260 may include an Invoke Module 264. The Invoke Module 264 may retrieve documents to be displayed to a maintainer or interacts with an external document server system (not shown) to cause externally managed documents to be imported and displayed.

To reiterate, each of the SEAMs (221-264) discussed above are never modified. The SEAMs (221-264) are loaded into any computing node (120-160) of the hierarchical system 200 and any number of SEAMs may be loaded into a single node. Once installed, each standard executable application module (221-264) may be initialized, directed and redirected by a user by changing the configuration data 180 resident in the database 190 to perform specific tasks in regard to its host computing device or platform, via its associated configuration file 185.

Communication between SEAMs (221-264) within a node is facilitated by a callable interface 300. A callable interface 300 is resident in each computing node (120-160) of the hierarchical structure 200. The callable interface 300 may have several sub-modules (302-310) that may be co-resident in a single computing device of a computing node (120-160). Exemplary sub-modules of the callable interface 300 may include a framework executive module 301 as a component of the callable interface 300, a workflow service 310, an error reporting server 302, a debugging server 303, a framework data accessor, a run-time shared data manager 305 and common utilities 306. Those of ordinary skill in the art will recognize that in equivalent embodiments a “module,” “a sub-module,” “a server,” or “a service” may comprise software, hardware, firmware or a combination thereof.

The framework executive module 301 of a computing node provides functions that integrate the nodes within the hierarchical system 200. The framework executive module 301 in conjunction with the configuration files 185 coordinate initialization of each node including the SEAMs (221-264) and the other service modules 301-310 allowing the execution of functions that are not triggered by the customized adapter 325. In some embodiments, the computing nodes in all application layers may have a framework executive module 301. In other embodiments, nodes in most application layers except, for example, an EHM Layer 120 will have a framework executive module 301. In such embodiments, the computing nodes in the EHM layer 120 may rely on its host platform (i.e. computing device) operating software to perform the functions of the framework executive.

Error reporting services 302 provide functions for reporting run-time errors in a node (120-160) within the hierarchical structure 200. The error reporting server 302 converts application errors into symptoms that are then processed as any other failure symptom, reports application errors to a debugging server 303 and reports application errors to a persistent data manager (not shown).

Debugging services 303 collects and reports debugging status of an executable application module (221-264) during testing, integration, certification, or advanced maintenance services. This server may allow the user to set values for variables in the DDS 350 b and to assert workflow events.

The framework data accessor 304 provides read access to the SDS 350 a and read/write access to the DDS 350 b (each stored in a memory 190) by the SEAMs (221-264) in a computing node (120-160). Write access to the SDS 350 a is accomplished via the data driven modeling tool 171, which includes GUI 170.

The run-time shared data manager 305 manages all node in-memory run-time perishable data structures that are shared between SEAMs (221-264) that are not stored in the DDS 350 b, but does not include cached static data. As non-limiting examples of perishable data structures may include I/O queues and circular buffers.

Common utilities 306 may include common message encoding/decoding, time-stamping and expression evaluation functions for use by the SEAMs (221-264) installed in a computing node.

The work flow service 310 is a standardized set of logic instructions that enable a data-driven flow of tasks within a computing node to be executed by the various SEAMs (221-264) within the node. The workflow service 310 acts as a communication control point within the computing node where all communications related to program execution to or from one executable application module (221-264) are directed through the node's workflow service 310. Stated differently, the workflow service 310 of a node (120′-160′) orchestrates the work flow sequence among the various SEAMs (221-264) that happen to reside in the node. In some embodiments the workflow service 310 may be a state machine.

FIG. 3 is a simplified, exemplary schematic of a configured hierarchical structure 200 that may optimize the run time performance of the hierarchical structure 200. The exemplary embodiment of FIG. 3 features a hierarchical structure 200 comprising five exemplary hierarchical layers (120-160), although in other embodiments the number of hierarchical layers may range from a single layer to any number of layers. Each hierarchical layer (120-160) includes one or more nodes (120′-160′) containing SEAMs (221-264) that were copied and loaded from one of the reusable libraries (220-260) into a computing node (120′-160′) in the layer. Each SEAM (221-264) may be configured by a user 210 by modifying its respective loadable configuration file 185. The loadable configuration file 185 is constructed using the data driven modeling tool 171.

For the sake of simplicity, the SEAMs (221-264) will be discussed below in terms of their respective libraries. The number of combinations and permutations of executable applications (221-264) is large and renders a discussion using specific SEAMs unnecessarily cumbersome.

At an EHM layer 120, there may be a number of EHM nodes 120′, each being operated by a particular host computing device that is coupled to one or more sensors and/or actuators (not shown) of a particular component of the complex system. As a non-limiting example, the component of the complex system may be a roller bearing that is monitored by a temperature sensor, a vibration sensor, a built-in-test, sensor and a tachometer, each sensor being communicatively coupled to the computing device (i.e. a node). As a non-limiting example, the host computing device of an EHM 120′ of the complex system may be a computer driven component area manager (“CAM”) (i.e. a node). For a non-limiting example of a CAM that may be suitable for use as EHM nodes, see co-owned, co-pending U.S. patent application Ser. No. 12/493,750 to Goldstein.

Each EHM (120′) host computing device in this example is operated by a host software application 330. The host software application 330 may be a proprietary program, a custom designed program or an off-the-shelf program. In addition to operating the host device, the host software application also may support any and all of the SEAMs (221-264) via the framework services 310 by acting as a communication interface means between EHMs 120′ and between EHMs 120′ and other nodes located in the higher levels.

The exemplary embodiment of FIG. 3 illustrates that the host software application 330 of an EHM 120′ may host (i.e. cooperate) one or more SEAMs 220 e from the Measure Library 220, one or more SEAMs 230 e from the Extract Library 230 and one or more SEAMs 250 e from the Act Library 250. The SEAMs 220 e, 230 e, and 250 e are identical to their counterpart application modules that may reside in any another node in any other level in the hierarchical structure 200. Only when directed by the configuration file 185 e, will a SEAM(s) (221-264) differ in performance from its counterpart module that has been configured for and is a resident in another node in the hierarchical structure 200. Once configured/directed, a standardized executable application (221-264) becomes a special purpose executable application module.

At an AHM level 130, there may be a number of AHM nodes 130′. Each AHM node is associated with a particular host computing device that may be coupled to one or more sensors and/or actuators of a particular component(s) or a subsystem of the complex system and are in operable communication with other AHM nodes 130′, with various EHM nodes 120′ and with higher level nodes (e.g., see 501, 502, 601 and 602 in FIGS. 5-6). As a non-limiting example, the host computing device of an AHM of the complex system may be a computer driven sub-system area manager (“SAM”) (i.e. a node) operating under its own operating system (not shown). For non-limiting examples of a SAM that may be suitable for use as an AHM node, see co-owned, co-pending patent application Ser. No. 12/493,750 to Goldstein.

The exemplary AHM node 130′ of FIG. 3 illustrates that the AHM 130′ has an additional interpret functionality 240 d that in this example has not been configured into the EHM 120′. This is not to say that the EHM 120′ cannot accept or execute a function from the Interpret library 240, but that the system user 210 has chosen not to populate the EHM node 120′ with that general functionality. On the other hand, the AHM node 130′ software hosts one or more SEAMs 220 d from the Measure Library 220, one or more SEAMs 230 d from the Extract Library 230 and one or more SEAMs 250 d from the Act Library 250. In their unconfigured or undirected state, the SEAMs 220 d, 230 d, and 250 d are identical to their counterpart application modules that may reside in any another node in any other level in the hierarchical structure 200.

Unlike the exemplary EHM node 120′, the exemplary AHM node 130′ may include a different communication interface means such as the customized adapter 325 d. A customized adapter 325 is a set of services, run-time software, hardware and software tools that are not associated with any of the SEAMs (221-264). The customized adapters 325 are configured to bridge any communication or implementation gap between the hierarchical CBM system software and the computing device operating software, such as the host application software (not shown). Each computing node (120′-160′) may be operated by its own operating system, which is its host application software. For the sake of clarity, FIG. 3 shows only the host application software 330 for the EHM 120′. However, host application software exists in all computing nodes (120′-160′).

In particular the customized adapters 325 provide symmetric communication interfaces (e.g., communication protocols) between computing nodes and between computing nodes of different levels. The customized adapter 325 a-d allow for the use of a common communication protocol throughout the hierarchical structure 200 from the lowest EHM layer 120 to the highest enterprise layer 160 as well as with the memory 190.

At a VHM layer 140, there may be a number of VHM nodes 140′, each VHM node is associated with a particular host computing device that may be in operative communication with one or more sensors and/or actuators of a particular component(s) of the complex system via an EHM 120′ or to subsystems of the complex system and that are in operable communication via their respective AHMs 130′. As a non-limiting example, the VHM 140′ may be a computer driven system level control manager (“SLCM”) (i.e. also a node). For non-limiting examples of a SLCM that may be suitable for use as a VHM node, see co-owned, co-pending patent application Ser. No. 12/493,750 to Goldstein.

In the exemplary hierarchical structure 200 there may be only one VHM 140′, which may be associated with any number of AHM 130′ and EHM 120′ nodes monitoring a sub-systems of the complex system. In other embodiments, there may more than one VHM 140′ resident within the complex system. As a non-limiting example, the complex system may be a fleet of trucks with one VHM 140′ in each truck that communicates with several EHMs 120′ and with several AHMs 130′ in each truck. Each group of EHMs 120′ and AHMs 130′ in a truck may also be disposed in a hierarchical structure 200

FIG. 3 further illustrates that the exemplary VHM 140′ has an additional Interact functionality 260 c that has not been loaded into the EHM 120′ or into the AHM 130′. This is not to say that these lower level nodes cannot accept or execute an Interact function 260, but that the system user 210 has chosen not to populate the lower level nodes with that functionality. On the other hand, for example, the host software of VHM 140′ hosts one or more SEAMs 220 c from the Measure Library 220, one or more SEAMs 230 c from the Extract Library 230, one or more SEAMs 240 c from the Interpret Library 240 and one or more SEAMs 250 c from the Act Library 250. The executable applications from the Interact library allow the system user 210 to access the VHM 140′ directly and to view the direction thereof via the GUI 170. In their undirected state, the SEAMs 220 c, 230 c, 240 c and 250 c are identical to their counterpart application modules that may reside in any another node in any other level in the hierarchical structure 200. The standardized executable applications 220 c-260 c are directed to carry out specific functions via configuration files 185 c.

At the Maintainer (MNT) layer 150, there may be a number of MNT nodes 150′, like the exemplary AHM node 130′, an exemplary MNT node 150′ includes a customized adapter 325 c. The customized adapter 325 c is also configured to bridge any communication or implementation gap between the hierarchical system software and the computing device operating software operating within VHM 140′. Each MNT node is associated with a particular host computing device that may be in operative communication with one or more sensors and/or actuators of a particular component(s) of the complex system via an EHM 120′, to subsystems of the complex system and that are in operable communication via their respective AHM 130′, and to the VHMs 140′. As a non-limiting example, the MNT node 150′ may be a laptop computer in wired or wireless communication with the communication system 9 of the hierarchical structure 200.

FIG. 3 illustrates that the exemplary MNT node 150′ may have the functionality of some or all of the executable applications (221-264). This is not to say that these lower level nodes cannot accept or execute any of the executable applications (221-264), but that the system user 210 has chosen not to populate the lower level nodes with that functionality. Like the exemplary VHM 140′ the executable application(s) 260 b from the Interact library allow the system user 210 to access the Maintainer node 150′ directly and may view the direction thereof via the GUI 170. In their undirected state, the SEAMs 220 b, 230 b, 240 b and 250 b are identical to their standard counterpart application modules that may reside in any another node in any other level in the hierarchical CBM structure 200. The executable applications 220 b-260 b are directed to carry out specific functions via configuration files 185 b.

Like the exemplary AHM node 130′ and VHM node 140′, the MNT node 150′ includes a customized adapter 325 b. The customized adapter is also configured to bridge any communication implementation gap between the hierarchical system software and the computing device operating software operating within the various nodes of the hierarchical structure 200.

At the Enterprise (ENT) layer 160, there may be a number of ENT nodes 160′, each ENT node is associated with a particular host computing device that may be in operative communication with one or more sensors and/or actuators of a particular component(s) of the complex system via an EHM 120′, to subsystems of the complex system and that are in operable communication via their respective AHM modules 130′ and the VHMs 140′, as well the MNT nodes 150′. As a non-limiting example, the ENT node 160′ may be a general purpose computer that is in wired or wireless communication with the communication system 9 of the hierarchical structure 200.

FIG. 3 also illustrates that the ENT 160′ may have the functionality of some or all of the executable applications (221-264) as selected and configured by the user. Like the exemplary VHM node 140′, the executable application(s) 260 a from the Interact library allow the system user 210 to access the ENT 160′ node directly via the GUI 170. In their undirected state, the SEAMs 220 a, 230 a, 240 a and 250 a are identical to their undirected counterpart application modules (221-264) that may reside in any another node in any other level in the hierarchical structure 200. The executable applications 220 a-260 a are configured/directed to carry out specific functions via configuration files 185 a.

Like the exemplary AHM node 130′, VHM node 140′ and the MNT node 150′, the ENT node 160′ includes a customized adapter 325 a. The customized adapter 325 a is also configured to bridge any communication or implementation gap between the hierarchical system software and the host computing device software operating within the ENT node.

In various embodiments, none of the computing nodes (120′-160′) are able to communicate directly with one another. Hence, all computing nodes (120′-160′) communicate via the customized adapters (325). In other embodiments, most computing nodes 120′-160′ may communicate via the customized adapters (325). For example, an exception may be an EHM 120′, which may communicate via its host executive software 330.

Like the executable applications (221-264), the operation of each of the customized adapters 325 is controlled by the workflow service 310 of its own node. The workflow service 310 will invoke one or more of the SEAMs (221-264) and services (302, 303, 306) to make data available to the customized adapter 325, which provides data from a node onto a data bus of the communication system 9 and pulls data from the bus at the direction of one of the executable applications (221-264). For example, the Acquire SEAM 221 or the Report Module 253 executes these communication functions

The communication system 9 may be any suitable wired or wireless communications means known in the art or that may be developed in the future. Exemplary, non-limiting communications means includes a CANbus, an Ethernet bus, a firewire bus, spacewire bus, an intranet, the Internet, a cellular telephone network, a packet switched telephone network, and the like.

The use of a universal input/output front end interface (not shown) may be included in each computing node (120′-160′) as a customized adapter 325 or in addition to a customized adapter 325. The use of a universal input/output (I/O) front end interface makes each node behind the interface agnostic to the communications system by which it is communicating. Examples of universal I/O interfaces may be found in co-owned application Ser. Nos. 12/750,341 and 12/768,448 to Fletcher and are examples of communication interface means.

The various computing nodes (120′-160′) of the hierarchical structure 200 may be populated using a number of methods known in the art, the discussion of which is outside the scope of this disclosure. However, exemplary methods include transferring and installing the pre-identified, pre-selected standardized executable applications to one or more data loaders of the complex system via a disk or other memory device such as a flash drive. Other methods include downloading and installing the executable applications directly from a remote computer over a wired or wireless network using the complex system model 181, the table generator 183 and the GUI 170.

The data driven modeling tool 171, table generator 183 and the GUI 170 may be driven by, or be a subsystem of any suitable HMS computer system known in the art. A non-limiting example of such an HMS system is the Knowledge Maintenance System used by Honeywell International of Morristown N.J. and is a non-limiting example of a model based configuration means. The data driven modeling tool 171 allows a subject matter expert to model their hierarchical system 200 as to inputs, outputs, interfaces, errors, etc. The table generator 283 then condenses the system model information into a compact dataset that at runtime configures or directs the functionality of the various SEAMs (221-264) of hierarchical system 200.

The GUI 170 renders a number of control screens to a user. The control screens are generated by the HMS system and provide an interface for the system user 210 to configure each SEAM (221-264) to perform specific monitoring, interpretation and reporting functions associated with the complex system.

FIG. 4 is a flow diagram of an exemplary method 1000 for configuring/reconfiguring a hierarchical structure 200 comprising computing nodes (120-160) that are monitoring various components of the complex system. There may be any number and any combination of different types of levels of computing nodes.

The method begins by establishing a hierarchical structure 200 of computing nodes at process 1010. The hierarchical structure 200 of computing nodes is determined by the nature and construction of the complex system of concern, as well as the complexity of monitoring of the complex system that is required. As discussed above, in some embodiments there may be one or more computing nodes (120′-160′) associated with each component, with each sub-system and/or with the overall complex system. In addition, there may be a computing node (120′-160′) associated with a higher maintainer layer (150), as well as with a general enterprise layer (160). One computing node (120′-160′) may be physically and electronically different from another computing node on the same layer (120-160) or on a different level. In other embodiments, a computing node may be identical to all other computing nodes. FIG. 4 is an exemplary screen shot of GUI 170 (See FIG. 2) that allows a user to establish parent and child nodal relationships according to the complex system model.

At process 1040, a standardized framework executive module 301 is created and defined with the desired framework services (302-310). The standardized framework service module 301 is populated to all of the hierarchical computing nodes (120′-160′).

At process 1020, the libraries 220-260 of standardized executable applications are developed and established. As discussed above, each standardized executable function (221-264) is written to perform a standard class of functionality such as acquiring data, trending data and reporting data.

At process 1050, a system user 210 populates each computing node (120′-160′) with one or more of the standardized executable applications (221-264) and the standardized framework executive module 301. The number and combination of standardized executable applications populated within in a particular computing node (120′-160′) is entirely within the discretion of the system designer based on the functionality or potential functionality desired. A standardized executable application (221-264) may be populated or removed from a computing node (120′-160′) by any suitable means known in the art. Non-limiting examples of some means for populating a computing node (120-160) includes a maintenance load, a local data loader and loading via a network and communication system 9.

At process 1030, the complex system is modeled on the data driven modeling tool 171. Each computing node (120′-160′) is identified and associated with a particular component, sub-component and subsystem as may be desired to accomplish a particular level of monitoring. Each computing node (120′-160′) is assigned a particular set of SEAMS (221-264) that will be required to accomplish the desired monitoring functionality of the computing node (see, FIG. 4). At this point an Allocate SEAM, a Diagnose SEAM and a Rank SEAM may be assigned to each computing node (120′-160′) from which to form a DDR45 for each node along with the configuration file 185.

At process 1060, a plurality of configuration files 185 are created by a system user 210. A configuration file 185 comprises a static data portion (SDS) 350 a and a dynamic data portion (DDS) 350 b. Configuration files 185 contain a collection of editable data specific logic sequences that generate messages and data that are used by the workflow service 310 to respond to the receipt of data and messages from a SEAM to perform a specific function. For example, a SEAM X communicates to the workflow service 310 that it has completed a task. The workflow service 310 retrieves the next action from the configuration file and then commands the next SEAM Y to execute its standardized function with specific data. In other words, a configuration file contains specific data values and programming relationships/functions between data values to enable/disable and to configure each standard executable application to accomplish a special purpose(s). In equivalent embodiments, the editable data specific logic sequences contained in a configuration file may be a collection of state machines.

Thus, the configuration files 185 provide the information that allows the SEAMs to operate and to interact with each other. Specifically this interaction is controlled via the workflow service which obtains all of its directives from the configuration files 185 to enable or disable functionality of the SEAMs as well as provide processing of data within the node (120-160). The same SEAMs may be used in all nodes because the configuration files 185 and the workflow service 310 direct the execution of the SEAMs within a node and provides the ability to move functionality between nodes.

Configuration files 185 contain the definition of each node (120′-160′). This includes the information that a given node will process, how the node interacts with other nodes and special operations that are run within a given node. The configuration files 185 contain the information to process data, generate signals, diagnose failures, predict failures, monitor usage, monitor consumption and otherwise support maintenance, operation and data analysis. Thus, the DDR 45 is created by editing the configuration file to include the portion of the complex system fault module that is applicable to the components of the complex system that that reside within in the monitoring purview of a particular node (120-160). Hence, a computing node (120′-160′) populated with standardized executable applications (221-264) becomes a special purpose computing node capable of performing a variety of specific tasks based on its population of executable applications and their subsequent direction by configuration files 185.

Should a system user 210 desire to add specific functions, delete specific functions or redefine specific functions for a particular computing node (120′-160′) in the hierarchical structure 200, the configuration file 185 for a particular executable application (221-264) in a particular computing node (120′-160′) is modified within the KMS master database 350 as may be desired at process 1060 and then regenerated and installed at its associated computing node (120′-160′) at process 1070. Thus, specific functionality formerly resident in one computing node (120′-160′) may be added, deleted, modified or it may be moved to another computing node in any other hierarchical level.

For example, data “Trending” functionality being accomplished by an EHM 120′ associated with the temperature of a particular component may be shifted from the EHM 120′ to the VHM 140′ by adding the standardized “Trending” executable application to the VHM 140′ (or by enabling a dormant “Trending” functionality already in place) and then configuring the “Trending” executable application in the VHM to perform the operation. To complete the process, the Trending functionality in the EHM 120′ may be changed to remove the temperature trending functionality or to disable the Trending executable application. Further, the temperature data from the component is redirected to the VHM 140′ via the communication system 9. As such, the data being trended at the EHM 120′ may be still acquired and analyzed at the EHM 120′ but then sent from the EHM to the VHM 140′ for trending.

FIGS. 5 and 6 are simplified block diagrams of an exemplary computing node (120′-160′), which here happens to be an EHM 120′. Each computing node (120′-160′) utilizes its own host executive software 330. The host executive software 330 executes the normal operating functions of the host EHM 120′, but may also provide a platform for hosting the health maintenance functions residing in any SEAM (221-264) populating the computing node.

As described above, there are 24 SEAMs (221-264) disclosed herein. However, other SEAMs with additional functionalities may be included. As such, any discussion herein is intended to extend to any SEAMs that may be created in the future. However, in the interest of brevity and clarity of the following discussion, the number of SEAMs (221-264) has been limited to an Acquire SEAM 221, a Decode SEAM 222, Evaluate SEAM 231, a Record SEAM 234 and an Analyze SEAM 232 as these SEAMs may be viewed as providing some basic functionality common to each SEAM resident in each computing node (120′-160′) of the hierarchy.

In addition to the SEAMs (221-264), each computing node (120′-160′) also includes a configuration file 185 and a workflow service module 310. The configuration file 185 comprises the DDS 350 b and the SDS 350 a. Among other data structures, the DDS 350 b may comprise an Event Queue (EVQ) 351, a High Priority Queue (HPQ) 352, a Time Delayed Queue (TDQ) 353, a Periodic Queue (PQ) 354 and an Asynchronous Queue (PQ) 355. However, it will be appreciated by those of ordinary skill in the art that the number of queues, their categorization and their priority may be defined and redefined to meet the requirements of a particular application. The EVQ 351 may be subdivided into additional sub-queues (See Application 002.1404).

Referring to FIG. 6, the DDS 350 b may also include at least one message buffer 360 for each SEAM (221-264) that has been populated into the EHM 120′. However, in some embodiments only SEAMs within the Measure Library may have a message buffer. The DDS 350 b may also include a number of record snapshot buffers 370 and circular buffers 380 that store particular dynamic data values obtained from the complex system to be used by the various SEAMs (221-264) for various computations as provided for by the configuration file 185. The data stored in each of the message buffers 360, snapshot buffers 370 and circular buffers 380 is accessed using a data accessor (304) which may be any suitable data accessor software object known in the art. The particular data structure and the location in the DDS 350 b for the message buffers 160, circular buffers 380 and snapshot buffers 370, are predetermined and are established in a memory device at run time.

The SDS 350 a is a persistent software object that may be manifested or defined as one or more state machines 361 that map a particular event 362 being read by the workflow service module 310 from the Event Queue (EVQ) 351 to a particular response record 363 (i.e., an event/response relationship). The state machine 361 then assigns a response queue (352-355) into which the response record 363 is to be placed by the workflow service module 310 for eventual reading and execution by the workflow service module 310. The structure and the location of the persistent data in the SDS 350 a is predetermined and is established in a memory device at run time.

Events 362 may be received into the EVQ 351 in response to a message from an outside source that is handled by the customized adapter 325 of the computing node (120′-160′), as directed by the host executive software 330. Events 362 may also be received from any of the populated SEAMs (221-264) resident in the computing node (120′-160′) as they complete a task and produce an event 362.

In the more basic SEAMs, such as Sense 223, Acquire 221, Decode 222 and Evaluate 231, the event/response relationships stored within the SDS 350 a do not tend to branch or otherwise contain significant conditional logic. As such, the flow of events 362 and response records 363 is relatively straight forward. However, more sophisticated SEAMs such as Coordinate 252, Forecast 255 and Respond 261 may utilize sophisticated algorithms that lead to complicated message/response flows and will not be discussed further herein the interest of brevity and clarity.

As an operational example, the host executive software 330 may push an input message into an EHM 120′ that is received from an outside source. The host executive software 330 calls a customized adapter 325 which in turn calls the appropriate SEAM (221-264) resident in the EHM 120′ based on data included in the message. For Example, the called SEAM may be the Acquire SEAM 221. When called, the Acquire SEAM 221 places the input message into a message buffer 360 (e.g., the Acquire input message buffer), generates an event 362 and places the event into the EVQ 351. The event 362 may contain data about the complex system from another node or from a local sensor. In the interest of simplicity and clarity of explanation, this first event 362 will be assumed to be an “acquire data” message and the event 362 generated from the input message will be referred to herein as AQe₁. In other embodiments the input message AQ₁ may be generated by a SEAM (221-264) and the event AQ_(e1) pushed into the EVQ 351 by the SEAM.

Once the input message AQ₁ is placed in a message queue 360 and its corresponding event 362 is placed into the EVQ 351, then the Acquire SEAM 221 exits and returns control to the workflow service module 310 via return message 364. In this simple example, only a single processor processing a single command thread is assumed. Thus, while the processor is executing a particular SEAM (221-264), the workflow service module 310 and no other SEAMs are operating. Similarly, while the workflow service module 310 is being operated by the processor, no SEAMS (221-264) are in operation. This is because all steps in the operation are performed sequentially. However, in other embodiments, multiple processors may be used, thereby permitting multiple threads (i.e., multiple workflow service modules 310) to be operated in parallel using the same populated set of SEAMs (221-264) and the same configuration file 185.

Upon receiving the return 364 (See, FIG. 7), the workflow service module 310 resumes operation and reads event AQ_(e1) first in this example because event AQ_(e1) is the first event 362 in the EVQ 351. This is so because the EVQ 351 is the highest priority queue and because the workflow service module 310 may read events sequentially in a first-in-first-out (FIFO) manner. Therefore those of ordinary skill in the art will appreciate that any subsequent events stored in the EVQ 351 would be read in turn by the workflow server on FIFO basis. However, reading events in a FIFO manner is merely exemplary. In equivalent embodiments, the workflow service module may be configured to read events in some other ordinal or prioritized manner.

Once event AQ_(e1) is read, the workflow service module 310 consults the persistent data structures in the SDS 350 a to determine the required response record 363 to the event AQ_(e1). The response record 363 provided by the SDS 350 a may, for example, be a decode response record DEC_(r1) that directs the Decode SEAM 222 to process the data received from the input message AQ₁, which is now stored in a storage location in the DDS 350 b. The SDS 350 a also directs the workflow service module 310 to place the response record DEC_(r1) into one of the response queues 352-355, such as HPQ 352, and assigns the location in the response queue in which to place the response based on an assigned priority. The SDS 350 a may determine the appropriate queue and its priority location in the queue based on the input message type, the data in the input message and on other data such as a priority data field. The workflow service module 310 places the response record DEC_(r1) into the HPQ 352 at the proper prioritized location and returns to read the next event in the EVQ 351.

Because the EVQ 351 is the highest priority event/response queue in this example, the workflow service module 310 continues reading events 362 and posts responses records 363 until the EVQ is empty. When the EVQ 351 is empty, the workflow service module 310 begins working on response records 363 beginning with the highest priority response queue (352-355), which in this example is the HPQ 352.

The first prioritized response record in HPQ 352 in this example is the DEC_(r1) response (i.e., a Decode response). When read, the workflow service module 310 calls (via call 365) a response handler interface of the decode SEAM 222 for the Decode SEAM to operate on the data referenced in the DEC_(r1) response record 363.

After being called by the workflow service module 310, the Decode SEAM 222 consults the SDS 350 a with the response record DEC_(r1) to determine what operation it should perform on the data associated with DEC_(r1) and performs it. As disclosed above, a SDS 350 a maps the event DEC_(r1) to a predefined response record 363 based on the message type and the data referenced within DEC_(r1). Data associated with event DEC_(r1) may reside in any of the record snapshot buffers 370, circular buffers 380, or the data may have to be queried for from a source located outside the exemplary EHM 120′.

The Decode SEAM 222 operates on the data and generates an event 362 and places the event into the EVQ 351 and a message into the message queue 360. For example, the response record 363 generated by the Decode SEAM 222 may be EVAL_(e1) indicating that the next process is to be performed by the Evaluate SEAM 231. The Decode SEAM 222 then exits and sends a return message 364 back to the workflow service module 310 to resume its operation. The process begins anew with the workflow service module 310 reading the EVQ 351 because there are now new events (including EVAL_(e1)) that have been added to the queue.

In the normal course, the work flow service module 310 eventually reads event EVAL_(e1) and consults the SDS 350 a to determine the proper response record 363 and which response queue to place it and in what priority within the response queue. In this example the response EVAL_(r1) is also place in the HPQ 352 and is in first priority because the response record DEC_(r1) would have already been operated on and dropped out of the queue. The workflow service then reads the next event from the EVQ 351, and the process continues

FIG. 7 is a simplified flow chart of a method 1300 for coordinating the operation of various SEAMs (221-264) within a computing node (120′-170′). However, those of ordinary skill in the art will appreciate that the use of multiple processors will allow for multiple threads to be processed in parallel.

At process 1310, an event 362 is pushed into the system by the customized adapter 325 or, in the case of some EHMs 120′ by the host executive software 330. In some embodiments, the host executive 330 may make a function call 1311 to a SEAM (221-264) to accept the event message such as the Acquire SEAM 221. At process 1330, the event record 362 is placed into the EVQ 351 by the called Seam (221-264) in the order in which it was received and the input message is stored in a queue or a message buffer 360 resident in the DDS 350 b by the SEAM (221-264). The SEAM (221-264) then sends a return command 1312 to the customized adapter 325 and exits.

It is assumed in this simple example, the workflow service module 310 had no other events or response records to process. Therefore the workflow service module 310 may restart at process 1340, although it may restart at any point in its routine. At process 1340, the workflow service module 310 attempts to read the next event record in FIFO order from the EVQ 351. If it is determined that the EVQ 351 is not empty at decision point 1341, then the workflow service module 310 reads the next event 362 from the EVQ and then consults the persistent data (e.g., a state machine) in the SDS 350 a with the event 362.

At process 1360, the SDS 350 a receives the event 362 as an input and produces a predefined response record 363. The SDS 350 a also indicates the response queue (352-355) into which the response record 363 is to be placed, and indicates a priority location for the response record in the response queue as. Any data associated with an event/response record is stored in a shared data structure in the DDS 350 b, such as in a circular buffer 380 or in a record snapshot buffer 370.

At process 1370, the workflow service module 310 stores the response record 363 into the assigned response queue (352-355) in its priority order and then returns to process 1340 to read the next event 362.

When the SDS 350 a assigns response queues, the highest priority response records 363 are placed in the HPQ 352 in their order of assigned priority and not on a FIFO basis. Response records 363 of lesser priority, such as responses records requiring a time delay may be placed in the TDQ 535. Responses records 363 of still lesser priority may be placed in the PQ 354. Such response records 363 in the PQ 354 may need to be addressed only on a periodic basis, for example. Response records 363 of the least priority are assigned to the AQ 355 and may be addressed asynchronously as the higher priority response queues permit. Further, response records 363 are placed into one of the response queues 353-355 according to a processing priority that is assigned by the SDS 350 a and may or may not be placed on a FIFO basis. The above described loop (1340, 1360, 1370) continues for as long as there are events 362 in the EVQ 351.

If the EVQ 351 is determined to be empty at determination point 1341, then the workflow service module 310 proceeds to the highest priority response queue (352-355) that contains a response record 363 and reads the highest priority response record (e.g. the first or the next response record), at process 1350. When a response record 363 is read, the workflow service module 310 issues a function call 365 to the SEAM (221-264) referenced in the response record 363 to perform its function on the data indicated in the response record 363 and then exits.

At process 1380, the called SEAM (221-264) consults the SDS 350 a to determine the task to be performed by the SEAM. Although not strictly required for simple SEAM functions such as the Acquire SEAM 221, more complex SEAMs such as the Forecast SEAM 255 or the Coordinate SEAM 252, for example, may have various alternative algorithms or conditional logic that may be performed. As such the SDS 350 a, may direct the SEAM as to which explicit functionality or algorithm to execute.

At process 1390, the designated SEAM performs its function or task on the data associated with the response record 363. Once the SEAM 221-264 performs its function, the method 1300 proceeds to process 1320 where a new event record is generated and placed into the EVQ 351 and the method 1300 repeats.

FIG. 8 is conceptual diagram of an exemplary diagnostic processing system 1600 for fault conditions by a DDR 45 at a node (120-160) (See, copending application Ser. No. 13/273,784 for details concerning a DDR). Exemplary EHM1 1610 and EHM2 1612 are independent EHM nodes 120 that commonly monitor system varying properties of components (S1, S2) utilizing sensors (1605, 1605′). In this particular example, EHM1 1610 detects a symptom and triggers a fault message 1630 to AHM1 1615.

AHM1 1615 receives the data fault message 1630 and sends a data collection message 1635 to one or more of the EHMs (1610, 1612) to collect additional data concerning the ongoing fault. The collected data is sent back to the AHM1 1615 in a data message 1640, were the collected data is analyzed and/or then forwarded to a higher level node such as VHM 1620 for additional processing and analysis. The AHM1 1615 is configured with the appropriately programmed SEAMS to analyze the fault trigger, prepare data collection requests for specifically targeted nodes (120-160) and forward an analysis of the collected data.

FIG. 9 is an exemplary flow chart illustrating an exemplary diagnostic processing method 1700 for fault conditions by a DDR 45 at a node (120-160) according to embodiments. It will be recognized by those of ordinary skill that the method 1700 is merely one of many different method embodiments that may be executed by the SEAMS (221-264) discussed below or any one of the SEAMS.

The method 1700 begins at point “A” with the workflow service 310 receiving an incoming fault message from a local sensor 1605 and creating an event 1330 that evokes a response to the symptom, which is placed in the event queue 351 for the Allocate SEAM 241 to process. In the interest of brevity, the step-by step process for handling events and response records previously discussed in regard to FIG. 7 will be omitted unless required for clarity.

At process 1710, based on the call from the workflow service 310, the Allocate Module 241 requests from the DDS 350 b a list of active fault conditions that are possibly linked to one or more of the symptoms detected by the Acquire and Decode SEAMs. At this point it is useful to point out that requests for static and dynamic data from the DDS 350 b and the SDS 350 a, respectively are carried out by data accessors 304. Data accessor 304 may be any type of accessor known in the art or developed in the future and will not be described further herein.

At process 1715, the fault condition list is returned by the DDS 350 b. At process 1720, the Allocate Module 241 determines if there are any fault conditions in the list returned by the DDS 350 b that pertain to the reported symptom(s) at decision point 1720. It should be noted that in this example, the Allocate Module 241 is described as performing a limited set of functions. However, those of ordinary skill in the art will appreciate that the Allocate Module 241, like all of the SEAMS (221-264), may contain instructions to perform any number of additional specific functions. If a fault condition is determined to pertain to the reported symptom at decision point 1720, the process terminates at process 1790 where the allocate SEAM 241 provides an “allocation completed” event which the workflow service 310 places in the event queue 351.

At process 1725, when there are no fault conditions that fit the detected symptoms, a new fault condition record is created. The Allocate SEAM 241 queries the SDS 350 a for a fault condition template at process 1730, which is a static software object. The SDS 350 a provides the fault condition template for the construction of the new fault condition.

FIG. 10 is an exemplary method 1800 occurring at process 1725. To create a new fault condition for new/different symptoms being experienced, the Allocate SEAM first locates all of the Failure Modes (FM) already stored in the SDS 350 a that can cause the observed symptom(s) at process 1805. A FM refers to the manner or way in which a failure or fault is observed in terms of the symptoms it generates, the functions it affects, its ultimate cause and the corrective actions required to remove the cause. A key issue is that a FM is a conceptual entity used to describe a class of fault conditions that could occur in the system.

At process 1810, the Allocate SEAM 241 determines a list of all of the unique symptoms produced by the FMs located during process 1805 from the SDS 350 a. Any duplicate symptoms are deleted. A duplicate symptom occurs when more than one FM is known to produce the symptom.

At process 1815, the Allocate SEAM 241 determines all of the tests (e.g., built in test equipment or technician performed tests) that can identify one or more of the unique symptoms determined in process 1810 and at process 1820 the Allocate SEAM determines all of the corrective actions that are associated with the FMs identified in process 1815. At process 1830, the determined FMs, symptoms, diagnostic tests and corrective actions are populated into the fault condition template retrieved at process 1730 and stored in the DDS 350 b.

Returning to FIG. 9, the method then proceeds to process 1735, where a fault condition is associated with a detected symptom. An event is generated and stored in the EVQ 351.

At process 1740, the Allocate Seam examines each Failure Mode in the new Fault Condition to determine it has been identified as a Failure Mode that requires supplemental data collection. In the SDS 350 a one of the attributes of a FM identifies a data collection specification to use if the FM requires supplemental data collection from one or more remote computing nodes. If a data collection specification is identified, then the Allocate SEAM 241 creates a data collection request, which identifies the data collection specification that the remote node should utilize to collect the supplemental data. The data collection specification identifies the data to be collected, the node(s) that should collect the data, the collection period and instructions to process the supplemental data once the supplemental data collection is complete.

At process 1745 a Data Collection Request 1760 is created and stored in the DDS 350 b and the workflow service 310 is notified as discussed above in regard to FIG. 7. The Allocate SEAM generates an event for the Create Data Collection Request in accordance with the process of FIG. 7

At process 1755 the Data Collection Request event is read from the EVQ 351 by the workflow service 310 and the SDS 350 a is subsequently consulted for the appropriate response record 361 which is placed into the appropriate event response queue (352-355) or sub-queue. See, copending application Ser. No. 13/572,518 for additional detail on event and response queue operation and interoperation. Co-pending application Ser. No. 13/572,518 is incorporated herein by reference in its entirety. If supplementary data collection is not required the method proceeds to process 1770 where an Allocation Completed Event is generated and placed in EVQ 351.

At process 1775, the workflow service 310 eventually reads the Data Collection Request response record 361 and invokes the Coordinate SEAM 252. At process 1780, the Coordinate SEAM 252 sends the Collection Request Message to the customized adapter for distribution to any one of a number of other nodes within the complex system to generate and/or report pertinent data concerning the Fault Condition of concern.

Because SEAMS (221-264) do not have the ability to pass information directly from one SEAM to another, the Rank Module 243 requests that same fault conditions list as was requested or created by the Allocate Module 241. At process 1620′, the DDS 350 b returns the fault condition list to the Rank Module 243. At process 1660, the Rank Seam 243 performs the ranking function of a diagnostic reasoner and ranks the fault conditions. The ranking may be accomplished by one or more statistical ranking schemes based on historical analysis, engineering analysis, failure mode analysis or similar analysis methods. Examples of statistical ranking may be a likelihood of specific failure modes, a likelihood of specific test results occurring, a cost/benefit ratio of the fault isolation test, and/or the cost/benefit ratio for corrective action sequences. Once the fault condition is ranked, an event is generated and sent to the workflow service 310 at process 1665.

While at least one exemplary embodiment has been presented in the foregoing detailed description of the invention, it should be appreciated that a vast number of variations exist. It should also be appreciated that the exemplary embodiment or exemplary embodiments are only examples, and are not intended to limit the scope, applicability, or configuration of the invention in any way. Rather, the foregoing detailed description will provide those skilled in the art with a convenient road map for implementing an exemplary embodiment of the invention. It being understood that various changes may be made in the function and arrangement of elements described in an exemplary embodiment without departing from the scope of the invention as set forth in the appended claims. 

What is claimed is:
 1. A condition based method for accumulating supplemental fault condition data for a new fault condition record from a remote nodes in a hierarchical monitoring system, comprising: monitoring an operation of a component of a complex system with a computing node that includes a processor and a memory device, the memory device containing, a configuration file, the configuration file comprising failure modes (FM), symptoms manifested by the FMs, and a data collection specification for each failure mode; and populating at least one of the processor and the memory of the computing node with one or more standardized executable application modules (SEAM) and a workflow service, wherein the one or more SEAMS are unable to communicate directly with each other; wherein further one of the one or more SEAMS is configured to retrieve supplemental data associated with the new fault condition record by: determining all FMs that manifest the symptom of the new fault condition from the memory device, examining a data collection specification associated with each of the FMs determined to identify each determined FM that requires supplemental fault condition data from the remote node, creating a data collection request message to the remote node, receiving supplemental data requested in the data collection request message from the remote node, and incorporating the supplemental data into the fault condition record.
 2. The method of claim 1, wherein the one or more SEAMs includes an Allocate SEAM and a Coordinate SEAM.
 3. The method of claim 2, wherein the Coordinate SEAM is configured to communicate the data collection request message to the remote computing nodes.
 4. The method of claim 3, wherein the data collection specification identifies the supplemental data to be collected, the remote node that that should collect the data, a collection period and instructions to process the data collected. 